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Preface 

This paper grew out of three tutorial lectures on automatic structures given 
at the Logic Colloquium 2007 in Wroclaw (Poland). The paper will follow the 
outline of the tutorial lectures, supplementing some material along the way. We 
discuss variants of automatic structures related to several models of computation: 
word automata, tree automata, Biichi automata, and Rabin automata. Word 
automata process finite strings, tree automata process finite labeled trees, Biichi 
automata process infinite strings, and Rabin automata process infinite binary 
labeled trees. Finite automata are the most commonly known in the general 
computer science community. Such an automaton reads finite input strings from 
left to right, making state transitions along the way. Depending on its last state 
after processing a given string, the automaton either accepts or rejects the input 
string. Automatic structures are mathematical objects which can be represented 
by (word, tree, Biichi, or Rabin) automata. The study of properties of automatic 
structures is a relatively new and very active area of research. 

We begin with some motivation and history for studying automatic struc- 
tures. We introduce definitions of automatic structures, present examples, and 
discuss decidability and definability theorems. Next, we concentrate on finding 
natural isomorphism invariants for classes of automatic structures. These classes 
include well-founded partial orders, Boolean algebras, linear orders, trees, and 
finitely generated groups. Finally, we address the issue of complexity for auto- 
matic structures. In order to measure complexity of automatic structures we 
involve topology (via the Borcl hierarchy), model theory (Scott ranks), com- 
putability theory (Z'^-completeness), and computational complexity (the class 
P)- 

This paper consists of three sections based on the tutorial lectures. The first 
lecture provides motivating questions and historical context, formal definitions of 
the different types of automata involved, examples of automatic structures, and 
decidability and definability results about automatic structures. The second lec- 
ture considers tree and Rabin automatic structures, and outlines techniques for 
proving non-automaticity. We then turn our attention to the study of algorith- 
mic and structural properties of automatic trees, Boolean algebras, and finitely 



generated groups. The final lecture presents a framework for reducing certain 
questions about computable structures to questions about automatic structures. 
These reductions have been used to show that, in some cases, the sharp bound 
on complexity of automatic structures is as high as the bounds for computable 
structures. We conclude by looking at Borel structures from descriptive set the- 
ory and connecting them to Biichi and Rabin automatic structures. 

1 Basics 

1.1 Motivating questions 

The study of structures has played a central role in the development of logic. 
In the course of this study, several themes have been pursued. We will see how 
questions related to each of these themes is addressed in the particular case of 
automatic structures. 

The isomorphism problem. One of the major tasks in the study of structures 
is concerned with the classification of isomorphism types. The isomorphism prob- 
lem asks: "given two structures, arc they isomorphic?'' Ideally, we would like to 
define invariants which are simpler than their associated structures and yet de- 
scribe the structures up to isomorphism. An example of such an invariant is the 
dimension of vector spaces. For algebraically closed fields, such invariants are 
the characteristics of the fields along with their transendence degrees. Despite 
this, there are well-studied classes of structures (such as abelian groups. Boolean 
algebras, linearly ordered sets, algebras, and graphs) for which it is impossible 
to give simple isomorphism invariants. Thus, in general, there is no solution to 
the isomorphism problem. 

The elementary equivalence problem. Since there is no general solution to 

the isomorphism problem, we may approximate it in various ways. One natural 
approximation comes from logic in the form of elementary equivalence. While 
elementary equivalence may be defined with respect to any logic, wc refer to el- 
ementary equivalence with respect to either the first-order logic or the monadic 
second-order (MSO) logic. There are several good examples of positive results 
in this setting: elementarily equivalent (in the first-order logic) Boolean algebras 
can be characterized via Ershov-Tarski invariants [17], [57]; elementarily equiv- 
alent abelian groups (in the first-order logic) can be characterized via Shmelev 
invariants [54]. Moreover, there is a body of work devoted to understanding when 
elementary equivalence (or any of its weaker versions) imply isomorphism. An 
example here is Pop's conjecture: if two finitely generated fields are elementarily 
equivalent then they are isomorphic. See [55] for a solution of this problem. 

The model checking problem. The model checking problem is the name given 
by the computer science community to a particular instance of the elementary 
equivalence problem. The problem asks, for a given sentence and a given 
structure, whether the structure satisfies ip. For example, the sentence (p can be 
the axiomatization of groups or can state that a graph is connected (in which 
case a stronger logic is used than the first-order logic). 



Deciding the theories of structures. This is a traditional topic in logic that 
socks algorithms to decide the full first-ordc;r theory (or MSO theory) of a given 
structure. Clearly, the problem can be thought of as a uniform version of the 
model checking problem. Often, to prove that a structure has a decidable first- 
order theory, one translates formulas into an equivalent form that has a small 
number of alternations of quantifiers (or no quantifiers at all), and then shows 
that the resulting simpler formulas are easy to decide. This approach is inti- 
mately related to the next theme in our list. 

Characterization of definable relations. Here, we would like to understand 
properties of definable relations in a given structure. A classical example is real 
closed fields, where a relation is definable in the first-order logic if and only if it 
is a Boolean combination of polynomial equations and inequations [58] . Usually, 
but not always, a good understanding of definable relations yields a quantifier 
elimination procedure and shows that the theory of the structure is decidable. 

It is apparent that all of the above problems are interrelated. We will dis- 
cuss these problems and their refinements with respect to classes of automatic 
structures. 

1.2 Background 

The theory of automatic structures can be motivated from the standpoint of 
computable structures. The theory of computable structures has a rich history 
going back to van der Waerden who informally introduced the concept of an 
explicitly given field in the 1930s. This concept was formalized by Frolich and 
Shepherdson [18], and later extended by Rabin [48] in the 1950s. In the 1960s, 
Mal'cev initiated a systematic study of the theory (see, for example [40]). Later, 
computability theoretic techniques were introduced in order to study the effective 
content of algebraic and model theoretic results and constructions. See [21], [22] 
for the current state and historical background of the area. 

A computable structure is one whose domain and basic relations are all com- 
putable by Turing machines. As a point of comparison with finite automata, we 
note that the Turing machine represents unbounded resources and unbounded 
read-passes over the data. The themes outlined in Subsection 1.1 have been 
recast with computability considerations in mind and have given rise to the de- 
velopment of the theory of computable structures. For example, one may ask 
whether a given structure is computable. This is clearly a question about the 
isomorphism type of the structures since it asks whether the isomorphism type of 
the structure contains a computable structure. So, the study of the computable 
isomorphism types of structures is a refinement of the isomorphism problem for 
structures. Another major theme is devoted to understanding the complexity of 
natural problems like the isomorphism problem and the model checking prob- 
lem. In this case, complexity is often measured in terms of Turing degrees. We 
again refer the reader to [21], [22] and the survey paper [23]. 

In the 1980s, as part of their feasible mathematics program, Nerode and Rem- 
mel [43] suggested the study of polynomial- time structures. A structure is said 



to be polynomial-time if its domain and relations can be recognized by Turing 
machines that run in polynomial time. An important yet simple result (Cenzer 
and Remmel, [12]) is that every computable purely relational structure is com- 
putably isomorphic to a polynomial-time structure. While this result is positive, 
it implies that solving questions about the class polynomial-time structures is 
as hard as solving them for the class of computable structures. For instance, the 
problem of classifying the isomorphism types of polynomial-time structures is as 
hard as that of classifying the isomorphism types of computable structures. 

Since polynomial-time structures and computable structures yielded similar 
complexity results, greater restrictions on models of computations were imposed. 
In 1995, Khoussainov and Nerode suggested bringing in models of computations 
that have less computational power than polynomial-time Turing machines. The 
hope was that if these weaker machines were used to represent the domain and 
basic relations, then perhaps isomorphism invariants could be more easily un- 
derstood. Specifically, they suggested the use of finite state machines (automata) 
as the basic computation model. Indeed, the project has been successful as we 
discuss below. 

The idea of using automata to study structures goes back to the work of 
Biichi. Biichi [10], [11] used automata to prove the decidability of of a theory 
called SIS (monadic second-order theory of the natural numbers with one suc- 
cessor). Rabin [49] then used automata to prove that the monadic sc;(;ond-ordor 
theory of two successor functions, 525, is also decidable. In the realm of logic, 
these results have been used to prove decidability of first-order or MSO theories. 
Biichi knew that automata and Presburger arithmetic (the first-order theory of 
the natural numbers with addition) are closely connected. He used automata to 
give a simple (non quantifier elimination) proof of the decidability of Presburger 
arithmetic. Capturing this notion, Hodgson [27] defined automata decidable the- 
ories in 1982. While he coined the definition of automatic structures, throughout 
the 1980s his work remained unnoticed. In 1995, Khoussainov and Nerode [29] 
rediscovered the concept of automatic structure and initiated the systematic 
study of the area. 

Thurston observed that many finitely generated groups associated with 3- 
manifolds are finitely presented groups with the property that finite automata 
recognize equality of words and the graphs of the operations of left multipli- 
cation by a generator; these are the Thurston automatic groups. These groups 
yield rapid algorithms [15] for computing topological and algebraic properties of 
interest (such as the word problem). In this development, a group is regarded 
as a unary algebra including one unary operation for each generator, the opera- 
tion being left multiplication of a word by that generator. Among these groups 
are Coxeter groups, braid groups, Euclidean groups, and others. The literature 
is extensive, and we do not discuss it here. We emphasize that Thurston au- 
tomatic groups differ from automatic groups in our sense; in particular, the 
vocabulary of the associated structures is starkly different. Thurston automatic 
groups are represented as structures whose relations are all unary operations 
(corresponding to left multiplication). On the other hand, an automatic group 



in our sense deals with the group operation itself (a binary relation) and hence 
must satisfy the constraint that the graph of this operation be recognizable by 
a finite automaton. The Thurston requirement for automaticity applies only to 
finitely generated groups but includes a wider class of finitely generated groups 
than what we call automatic groups. Unlike our definition, Thurston's includes 
groups whose binary operation of multiplication is not recognizable by a finite 
automaton. 

In the computer science community, an interest in automatic structures comes 
from problems related to model checking. Model checking is motivated by the 
quest to prove correctness of computer programs. This subject allows infinite 
state automata as well as finite state automata. Consult [1], [2], [3] for cur- 
rent topics of interest. Examples of infinite state automata include concurrency 
protocols involving arbitrary number of processes, programs manipulating some 
infinite sets of data (such as the integers or reals), pushdown automata, counter 
automata, timed automata, Petri-nets, and rewriting systems. Given such an 
automaton and a specification (formula) in a formal system, the model checking 
problem asks us to compute all the states of the system that satisfy the specifi- 
cation. Since the state space is infinite, the process of checking the specification 
may not terminate. For instance, the standard fixed point computations very of- 
ten do not terminate in finite time. Specialized methods are needed to cover even 
the problems encountered in practice. Abstraction methods try to represent the 
behavior of the system in finite form. Model checking then reduces to checking a 
finite representation of states that satisfy the specification. Automatic structures 
arise naturally in infinite state model checking since both the state space and the 
transitions of infinite state systems are usually recognizable by finite automata. 
In 2000, Blumensath and Gradel [9] studied definability problems for automatic 
structures and the computational complexity of model checking for automatic 
structures. 

There has been a series of PhD theses in the area of automatic structures (e.g. 
Rubin [52], Blumensath [8], Barany [4], Minnes [41], and Liu [39]). A recently 
published paper of Khoussainov and Nerode [30] discusses open questions in 
the study of automatic structures. There are also survey papers on some of the 
areas in the subject by Nies [44] and Rubin [53] . This avenue of research remains 
fruitful and active. 

1.3 Automata recognizable languages and relations 

The central models of computation in the development of the theory of automatic 
structures are all finite state machines. These include finite automata, Biichi 
automata, tree automata, and Rabin automata. The main distinguishing feature 
of the difi^erent kinds of automata is the kind of input they read. 

Definition 1. A Biichi automaton Ai is a tuple {S, i, A, F), where S is a 
finite set of states, l is the initial state, A C S x S x S (with S a finite alphabet) 
is the transition table, and F C S is the set of accepting states. 



A Biichi automaton can be presented as a finite directed graph with labelled 
edges. The vertices of the graph arc the states of the automaton (with designated 
initial state and accepting states). An edge labeled with a connects a state g to a 
state q' if and only if the triple (q, cr, q') is in Zi. The inputs of a Biichi automaton 
are infinite strings over the alphabet S. Let a = aoai ... be such an infinite 
string. The string labels an infinite path through the graph in a natural way. 
Such a path is called a run of the automaton on a. Formally, a run is an infinite 
sequence qo,qi, . . . , of states such that qo is the initial state and {qt, CTj, gj+i) G A 
for all i & CO. The run is accepting if some accepting state appears in the run 
infinitely often. Note that the automaton may have more than one run on a 
single input a. We say that the Biichi automaton A4 accepts a string a if A4 
has an accepting run on a. Thus, acceptance is an existential condition. 

Definition 2. The set of all infinite words accepted by a Biichi automaton A4 
is called the language of M. A collection of infinite words is called a Biichi 
recognizable language if it is the language of some Biichi automaton. 

Example 1. The following two languages over S = {0, 1} are Biichi recognizable: 

— {a I a has finitely many Is}, 

— {a I a has infinitely many Is and infinitely many Os}. 

There arc efficient algorithms to decide questions about Biichi recognizable 
languages. A good reference on basic algorithms for Biichi automata is Thomas' 
survey paper [60]. A central building block in these algorithms is the linear- 
time decidability of the emptiness problem for Biichi automata. The emptiness 
problem asks for an algorithm that, given a Biichi automaton, decides if the 
automaton accepts at least one string. An equally important result in the devel- 
opment of the thc^ory of Biichi automata says that Biichi recognizable languages 
are closed under union, intersection, and complementation. 

Theorem 1 (Biichi; 1960). 

1. There is an algorithm that, given a Biichi automaton, decides (in linear 
time in the size of the automaton) if there is some string accepted by the 
automaton. 

2. The collection of all Biichi recognizable languages is closed under the opera- 
tions of union, intersection, and complementation. 

Proof. The first part of the theorem is easy: the Biichi automaton accepts an 
infinite string if and only if there is a path from the initial state to an accepting 
state which then loops back to the accepting state. Thus, the emptiness algorithm 
executes a breadth-first search for such a "lasso" . Closure under union and 
intersection follows from a standard product construction. It is considerably 
more difficult to prove closure under complementation. See [60] for a discussion 
of the issues related to the complementation problem for Biichi automata. □ 



Theorem 1 is true eflFectively: given two automata we can construct an au- 
tomaton that recognizes all the strings accepted by either of the automata (the 
union automaton), we can also construct an automaton that recognizes all the 
strings accepted by both automata (the intersection automaton). Likewise, there 
is an algorithm that given an automaton builds a new automaton (called the 
complement automaton) that accepts all the strings rejected by the original 
automaton. We emphasize that complementation constructions have deep sig- 
nificance in modern automata theory and its applications. 

Since we will be interested in using automata to represent structures, wc need 
to define what it means for a relation to be recognized by an automaton. Until 
now, we have discussed Biichi automata recognizable sets of infinite strings. 
It is easy to generalize this notion to relations on E'^. Basically, to process 
a tuple of infinite strings, we read each string in parallel. More formally, we 
define the convolution of infinite strings ai , . . . , afc G as the infinite string 
c(q!i, . . . , Qffe) G i^'^) whose value at position i is the tuple {ai{i), . . . , ak{i)). 
The convolution of a A:-ary relation R, denoted by c{R), is the set of infinite 
strings over S'^ which are the convolutions of tuples of R. We say that the 
relation R is Biichi recognizable if and only if the sc^t c{R) is Biichi recognizable. 

We will now describe some operations which preserve automaticity. Let A 
be a language of infinite strings over E and Rhe a relation of arity k on E'^. 
The cylindrification operation on a relation R (with respect to A) produces 
the new relation 

cyl{R) = {(ai, . . . , ttfe, a) : (ai, . . . , Ofe) G R and a € A}. 

The projection, or 3, operation (with respect to A) is defined by 

3xiR = {(ai, . . . , aj_i, flj+i, . . . , Ofe) : 3a e A{{ai, a^-i, a, aj+i, . . . , afe) e R)}. 

The universal projection, or V, operation (with respect to A) is defined as 

yxiR = {{ai, . . .,ai-i,ai+i, . . . ,afe) : Va e A{{ai, . . . , a^-i, o, Oj+i, . . . ,afe) e R)}. 

In all these operations {cyl, 3, V), if R and A are both Biichi recognizable then 
the resulting relations are also Biichi recognizable. The instantiation operation 

is defined by 

/(i?, c) = {{xi,...,Xk-i) : {xi,...,Xk-i,c) e R}. 

If R is Biichi automatic then I{R, c) is Biichi automatic if and only if c is an 
ultimately periodic infinite string. An ultimately periodic word is one of the form 
uv^ = uvvv ■ ■ ■ where u and v are finite strings. The rearrangement operations 
permute the coordinates of a relation. If tt : fc — » /c is a permutation on the set 
of k elements and i? is a fc-ary Biichi automatic relation, then 

ttR = {{x^(i),. . ■,x^(k)) ■■ {xi, ...,Xk)eR} 

is Biichi automatic. The linkage operation identifies the last coordinates of some 
relation with the first coordinates of another. Given the relations R (of arity mi) 



and S (of arity 7712) and index i < mi, the linkage of R and 5 on i is the relation 
of arity 1712 + i — 1 defined by 

L{Er\S'^^;i) = {(ai,...,a™,+i_i) : {ai,...,am^) e i?& (oi, . . . ,a„,+i_i) e S}. 

For example, i(i?^,5''; 2) = {(01,02,03,04,05) : (01,02,03) G i? & (02,03,04,05) e 
S*}. If i? and 5* are Biichi recognizable relations then so is L{R, S; i). 

The closure of Biichi recognizable relations under Boolean operations (The- 
orem 1) connects Biichi recognizability and propositional logic. The 3 and V 
operations bring us to the realm of first-order logic. We now connect automata 
with the monadic second-order (MSO) logic of the successor structure {u!;S). 
The MSO logic is built on top of the first-order logic as follows. There is one 
non- logical membership symbol S, and there are set variables X,Y, . . . that range 
over subsets of uj. Formulas are defined inductively in a standard way via the 
Boolean connectives and quantifiers over set variables as well as over individual 
variables. 

Example 2. In MSO of {u);S), the following relations and properties are ex- 
pressible: the subset relation X CY, the natural order <, finiteness of sets, and 
whether a set is a singleton. For example, 



The definability of singletons allows us to transform any MSO formula into 
an equivalent MSO formula all of whose variables are set variables. We can also 
interpret addition on natural numbers as follows. Associate with every finite set 
X the binary string ct{X) that has 1 at position i if and only if i G X. We use 
the rules of binary addition to express the statement that X,Y, Z are finite sets 
and a{X) +2 a{Y) = a{Z) in the MSO logic logic of (w; S). 

There is a natural correspondence between fc-ary relations on 7'(cj) (the power 
set of the natural numbers) and A:-ary relations on {0, 1}'^. For example, consider 
the case of fc = 2. Any binary relation R on V{u)) is a collection of pairs {X, Y) 
of subsets. Identify X and Y with their characteristic functions ax and ay, each 
of which is an infinite string over {0, 1}. Recall that the convolution of {X,Y) 
is an infinite string over {0,1}^ such that c{ax,OiY){i) = (ctxii), Oiyii))- The 
convolution of R is the language c{R) = {c{ax,ay) ■ {X,Y) £ R}. With this 
correspondence between relations on P{lj) and relations on {0, 1}", Biichi proved 
a general characterization theorem that links the MSO definable relations of the 
successor structure (w; S) and Biichi automata. 

Theorem 2 (Buchi, 1960). A relaMon R C V{lu)'' is definable in MSO logic 
if and only if R is Biichi recognizable. Given an MSO definable relation R, there 
is an algorithm which builds a Biichi automaton recognizing R. In particular, the 
MSO theory of {u; S), denoted as SIS, is decidable. □ 

We illustrate the theorem on a simple example: checking whether 3X\/Y R{X, Y) 
is true in (w; 5), where i? is a definable relation. Theorem 2 says that R is de- 
finable in MSO logic if and only if c{R) is Biichi recognizable. Using Theorem 




2, we build an automaton recognizing R. This is done inductively based on the 
complexity of the formula defining R. We then construct an automaton for the 
set of infinite strings {X : MY R{X, Y)}. We use Theorem 1 again to check if the 
set {X : VF R{X,Y)} is empty. If it is empty, the sentence is false; otherwise, 
it is true. Biichi's theorem is a quintessential example of how understanding the 
definable relations in a structure helps us decide the theory of the structure (see 
Subsection 1.1). 

We briefly turn our attention to word automata. The underlying graph 

definition of a word automaton is identical to that of a Biichi automaton. As 
mentioned earlier, the difference between these two models of computation lies 
in that word automata process finite strings rather than infinite strings. Let 
Ai = {S, i. A, F) be a word automaton over the finite alphabet E, and let v be 
a finite string over E. Then v labels a path in the underlying directed graph 
of M., starting from the initial state. This path is called a run of M. on input 
V. If the last state in the run is accepting then the nm is an accepting run. 
The automaton M accepts a string if there is some accepting run of M on the 
string. The collection of all finite strings accepted by a word automaton is called 
a regular (or. cquivalcntly, a FA recognizable) language. As in the setting of 
Biichi automata, we have the following theorem: 

Theorem 3 (Kleene, 1951). 

1. There is an algorithm that, given a word automaton, decides (in linear time 
in the size of the automaton) if some string is accepted by the automaton. 

2. The collection of all regular languages is closed under the operations of union, 
intersection, and complementation. 

Example 3. The following sets of strings are regular languages. 

— {twlOl : w e {0, 1}* has no sub-word of the form 101} 

- {w : w E {0, 1}* is a binary representation of some positive integer with the 

least significant bit first}. 

Example 4- Regular languages can be naturally embedded into Biichi recogniz- 
able sets. That is, W is regular if and only if WO'^ is Biichi recognizable (where 
O is a new symbol). 

Previously, we defined Biichi recognizable relations. We would like an anal- 
ogous notion for word automata. To define regular relations, we need to define 
the convolution of finite strings. Let xi,. . . ,Xk 6 E*. U Xi, . . . ,Xk are all of 
the same length, the i*'* element of the convolution c{xi, . . . ,Xk) is the tuple 
(xi(i), . . . ,Xk{i)) (as in the infinite string case). Otherwise, assume without loss 
of generality that Xn is the longest string among xi,. .. ,Xk. For each m ^ n, 
append a new symbol O to the end of Xm as many times as necessary to make 
the padded version of x^ have the same length as x„. Then c(xi, . . . , x^) is the 
convolution of the padded strings. If i? is a A;-ary relation on S* , R is called 
regular if its convolution c{R) is a regular language. 



As before, we can develop a calculus of regular relations. Given a regular 
relation R and regular set A, the cylindrification c{R), projection 3xiR, and 
universal projection Vxji? are all recognizable by finite automaton. Likewise, 
given a regular relation R and any finite string c, the instantiation I{R, c) is 
a regular relation. Similarly, the rearrangement and linkage operations preserve 
regularity of relations. 

1.4 Biichi and word automatic structures 

The main focus of this tutorial is the study of structures defined by automata. 
We now give a formal definition of this concept and provide several examples 
of such structures. Recall that a structure .A is a tuple {A; . . . , Rm) where 
A is a non-empty set called the domain and Ri, . . ., Rm are basic (or atomic) 
relations on A. 

Definition 3. A structure is word automatic if its domain and basic relations 
are regular. A structure is Biichi automatic if its domain and basic relations 
are Biichi recognizable. 

Often, we refer to word automatic structures and Biichi automatic structures 
simply as automatic structures. The type of automaticity will be clear from the 
context. We present some examples of word automatic structures. We begin with 
structures whose domains are {!}* (automatic structures over the one letter 
alphabet {1} are called unary automatic structures; they have been studied in 
[8], [52], [34]). 

Example 5. The structure (1*; <, S), where 1" < 1" <s=^ m < n and 5(1") = 
1"+"'^, is word automatic. 

Example 6. The structure (1*; mod i, mod 2, • • • , mod „), where n is a fixed 
positive integer, is word automatic. The word automata recognizing the modular 
relations contain cycles of appropriate lengths. 

Next, we move to structures with a binary alphabet {0, 1}. It is not too hard 
to sec that any automatic structure over a finite alphabet is isomorphic to an 
automatic structure over a binary alphabet [52]. Clearly, any word automatic 
structure has a countable domain. 

Example 7. The structure ({0, 1}*; V, A, -■) is word automatic because bit-wise 
operations on binary strings can be recognized by finite automata. 

Example 8. Presburger arithmetic, the structure ({0, 1}* • 1; +2, <), where +2 is 
binary addition if the binary strings are interpreted as the least significant bit 
first base-2 expansion of natural numbers. The usual algorithm for adding binary 
numbers involves a single carry bit, and therefore a small word automaton can 
recognize the relation +2. 



Example 9. Instead of Presburger arithmetic, we may consider the structure 
({0, !}*■ 1; +2, 12)- This is arithmetic with weak divisibility: w {2 v iiw represents 
a power of 2 which divides the number represented by v. Since we encode natural 
numbers by their binary representation, weak divisibility is a regular relation. 

Example 10. If we treat binary strings at face value rather than as representa- 
tions of natural numbers, we arrive at a different automatic structure: 

{{0,iy;^, Left, Right, EqL), 

where ^ is the prefix relation, Left and Right denote the functions which append 
a or 1 to the binary string (respectively), and EqL is the equal length relation. 
It is easy to show that this structure is automatic. We will see later that this 
structure has a central role in the study of automatic structures. 

Example 11. A useful example of a word automatic structure is the configuration 
space of a Turing machine. The configuration space is a graph whose nodes are 
the configurations of the machine (the state, the contents of the tape, and the 
position of the read/write head). An edge exists between two nodes if there is 
a one-step transition of the machine which moves it between the configurations 
represented by these nodes. 

Exam,ple 12. Any word automatic structure is I3iic;lii automatic, but the converse 
is not true. In the next subsection, we will see examples of Biichi automatic 
structures which have uncountable domains. These structures cannot be word 
automatic. 

We mention an automata theoretic version of the theorem of Lowenheim and 
Skolem. Recall that the classical theorem states that any infinite structure over a 
countable language has a countable elementary substructure. Recall that an ele- 
mentary substructure is one which has the same first-order theory as the original 
structure in the language expanded by naming all the elements of the substruc- 
ture. For Biichi automatic structures, the analogue of a countable substructure 
is the substructure consisting of ultimately periodic words. 

Theorem 4 (Hjorth, Khoussainov, Montalban, Nies; 2008). Let A he a 

Biichi automatic structure and consider the substructure A! whose domain is 

A' = {a & A : a is ultimately periodic]. 

Then A' is a computable elementary substructure of A. Moreover, there is an 
algorithm which from a first-order formula (p{a, x) with parameters a from A' 
produces a Biichi automaton which accepts exactly those tuples x of ultimately 
periodic words which make the formula true in the structure. 

This theorem has also been independently proved by Barany and Rubin. 



1.5 Presentations and operations on automatic structures 



The isomorphism type of a structure is the class of all structures isomorphic to 
it. We single out those isomorphism types that contain automatic structures. 

Definition 4. A structure A is called (word or Biichi) automata presentable 

if it is isomorphic to some (word, or Biichi) automatic structure B. In this case, 
B is called an automatic presentation of A. 

We sometimes abuse terminology and call automata presentable structures 
automatic. Let B = {D^; Rf, . . . , i?f ) be an automatic presentation of A. Since 
B is automatic, the sets _D®, i?,f , . . . , f?f arc all recognized by automata, say by 
A4,Mi, . . . ,A4s- Often the automatic presentation of A is identified with the 
finite sequence A4,A4i, . . . ,A4s of automata. Prom this standpoint, automata 
presentable structures have finite presentations. 

Examples of automata presentable structures arise as specific mathematical 
objects of independent interest, or as the result of closing under automata pre- 
sentability preserving relations. For example, if a class of structures is defined 
inductively and we know that the base structures have automata presentations 
and that each of the closure operations on the class preserve automaticity, then 
we may conclude that each member of the class is automata presentable. To aid 
in this strategy, we present some automaticity preserving operations on struc- 
tures. The automaton constructions are often slight modifications of those used 
to show that recognizable sets form a Boolean algebra. 

Proposition 1. If A and B are (word or Biichi) automatic structures then so 
is their Cartesian product A x B. If A and B are (word or Biichi) automatic 

structures then so is their disjoint union A + B. 

Proposition 2. If A is word automatic and E is a regular equivalence relation, 
then the quotient A/ E is word automatic. 

Proof. To represent the structure A/E, consider the set 

Rep={xCiA\ yy(y <„ex x k y G (x^y) ^ E)}, 

where <iiex is the length-lexicographical linear order. Recall that for any finite 
alphabet, the length lexicographic order is defined as x <iiex V if and only if 
\x\ < \y\ or \x\ = \y\ and \x\ <iex \y\. Since Rep is first-order definable in (A,E), 
it is regular and constitutes the domain of the quotient structure. Restricting the 
basic relations of A to the set Rep yields regular relations. Hence, the quotient 
structure is automatic. □ 

A straightforward analysis of countable Biichi recognizable languages shows 
that a countable structure has a word automatic presentation if and only if 
it has Biichi automatic presentation. It is natural to ask whether Proposition 2 
can be extended to countable Biichi automatic structures. This has recently been 
answered positively in [5] by a delicate analysis of Biichi recognizable equivalence 
relations with countably many equivalence classes. 



Theorem 5 (Barany, Kaiser, Rubin; 2008). For a Biichi automatic struc- 
ture A and a Biichi recognizable equivalence relation E with countably many 
equivalence classes, the quotient structure Aj E is Biichi automatic. 

A long-standing open question had asked whether Biichi automatic structures 
behave as nicely as word automatic structures with respect to Biichi recognizable 
equivalence relations. In other words, whether the countability assumption can 
be removed from the theorem above. A counterexample in [25] recently settled 
this question. We will outline a proof of the following theorem in the last lecture. 

Theorem 6 (Hjorth, Khoussainov, Montalban, Nies; 2008). The class 
of Biichi automatic structures is not closed under the quotient operation with 
respect to Biichi recognizable equivalence relations. 

We now give some natural examples of automata presentable structures. Note 

that many of the automatic structures that we mentioned earlier arise as the 
presentations of the following automata presentable structures. 

Example 13. The natural numbers under addition and order have a word au- 
tomata presentation. The automatic presentation here is the word structure 
({0,l}*-l;+2,<). 

Example I4. The real numbers under addition have a Biichi automata presen- 
tation. The presentation is ({0, 1}* • {★} • {0, l}'^; -^2). 

Example 15. Finitely generated abelian groups are all word automata presentable, 
Recall that a group Q is finitely generated if there is a finite set S such that Q 
is the smallest group containing S; a group Q is called abelian if the group op- 
eration is commutative a ■ b = b ■ a. Since every such group is isomorphic to 
a finite direct sum of (Z; and (Z„;-|-) [51], and since each of these has an 
automatic presentation, any finitely generated abelian group has an automatic 
presentation. 

Example 16. The Boolean algebra of finite and co-finite subsets of iv is word 
automata presentable. An automatic presentation is the structure whose domain 
is {0, 1}*U{2, 3}* where words in {0, 1}* represent finite sets and words in {2, 3}* 
represent cofinite sets. 

Example 17. The Boolean algebra of all subsets of u is Biichi automata pre- 
sentable. A presentation using infinite strings treats each infinite string as the 
characteristic function of a subset of w. The union, intersection, and comple- 
mentation operations act bitwise on the infinite strings and are recognizable by 
Biichi automata. 

Example 18. The linear order of the rational numbers (Q; <) has a word au- 
tomata presentation: ({0, 1}* ■ 1; <iex), where u <iex v is the lexicographic or- 
dering and holds if and only if -u is a prefix of v or u = wOx and v = wly for 
some w,x,y € {0, 1}*. 



1.6 Decidability results for automatic structures 



Biichi's theorem in Subsection 1.3 uses automata to prove the decidabihty of 
the MSO theory of the successor structure (w; S). We now explore other decid- 
abihty consequences of algorithms for automata. The foundational theorem of 
Khoussainov and Nerode [29] uses the closure of regular relations (respectively, 
Biichi recognizable relations) under Boolean and projection operations to prove 
the decidability of the first-order theory of any automatic structure. 

Theorem 7 (Khoussainov, Nerode; 1995. Blumensath, Gradel; 2000). 

There is an algorithm that, given a (word or Biichi) automatic structure A and a 
first- order formula ^{x\, . . . , a;„), produces an automaton recognizing those tuples 
(ai, . . . , a„) that make the formula true in A. 

Proof. We go by induction on the complexity of the formula, using the fact that 
automata recognizable relations are closed under the Boolean operations and 
the projection operations as explained in Subsection 1.3. □ 

The Khoussainov and Nerode decidability theorem can be applied to indi- 
vidual formulas to yield Corollary 1, or uniformly to yield Corollary 2. 

Corollary 1. Let A be a word automatic structure and Lp{x) be a first-order 
formula. There is a linear-time algorithm that, given a G A, checks if ^{a) holds 
in A. 

Proof. Let A^p be an automaton for (p. Given a, the algorithm runs through the 
state space of A^ and checks if A^ accepts the tuple. This can be done in linear 
time in the size of the tuple. □ 

Corollary 2. [Hodgson; 1982] The first- order theory of any automatic structure 
is decidable. 

The connection between automatic structures and first-order formulas goes in 
the reverse direction as well. That is, first-order definability can produce new au- 
tomatic structures. We say that a structure B = {B; Ri, . . . i?„) is first-order de- 
finable in a structure A if there are first-order formulas (ps and (pi, . . . , tpn (with 
parameters from A) which define B, . . . , i?„ (respectively) in the structure 
A. Khoussainov and Nerode's theorem immediately gives the following result 
about first-order definable structures. 

Corollary 3. If A is (word or Biichi) automatic and B is first-order definable 
in A, then B is also (word or Biichi) automatic. 

In fact, automatic structures can yield algorithms for properties expressed in 
logics stronger than first-order. We denote by 3°° the "there are countably in- 
finitely many" quantifer, and by 3"'™ the "there are m many mod n" quantifiers. 
Then (FO + 3°° + 3"'™) is the first-order logic extended with these quantifiers. 
The following theorem from [32] extends the Khoussainov and Nerode theorem 
to this extended logic. We note that Blumensath noted the 3°° case first, in [8]. 



Theorem 8 (Khoussainov, Rubin, Stephan; 2003). There is an algorithm 
that, given a word automatic structure A and a (FO + 3°° + 3"''"^ formula 
ip{xi, . . . , Xn) with parameters from A, produces an automaton recognizing those 
tuples (ai, . . . , a„} that make the formula true in A. 

Corollary 4. The (FO + 3°° + 3"''"^) -theory of any word automatic structure 
is decidable. 

The next corollary demonstrates how the extended logic can be used in a 
straightforward way. 

Corollary 5. If L is a word automatic partially ordered set, the set of all pairs 
{x, y) such that the interval [x, y] has an even number of elements is regular. 

Another interesting application is an automata theoretic version of Konig's 
lemma. Recall that the classical version of Konig's Lemma says that every infinite 
finitely branching tree contains an infinite path. In Lecture 2, we discuss Konig's 
lemma in greater detail. 

Corollary 6. Let T = (T; <) he a word automatic infinite finitely branching 
tree. There exists an infinite regular path in T . 

Proof. We assume that the order < is such that the root of the tree is the least 
element. We use the auxiliary automatic relations <iiex (the length lexicographic 
order) to give a {FO + 3°°) definition of an infinite path of T. Recall that the 
length lexicographic order is defined as x <iiex U if and only if \x\ < \y\ or 
= \y\ and \x\ <iex \y\. 

P = {x: 3°°y{x <y) k ^{y < x)^{z z' & S{y))[z <x ^ z <uex A) 

In words, P is the left-most infinite path in the tree. The first clause of the 
definition restricts our attention to those nodes of the tree which have infinitely 
many descendants. Since T is finitely branching, these nodes are exactly those 
which lie on infinite paths. It is easy to see that the above definition guarantees 
that P is closed downward, linearly ordered, and finite. Moreover, it is regular 
by the decidability theorem. Theorem 7. □ 

Recently, Kuske and Lohrey generalized the decidability theorem for [FO + 
goo ^ gn,m^ Bvidii automatic structures [38]. 

1.7 Definability in automatic structures 

Throughout computer science and logic, there arc classifications of problems 
or sets into hierarchies. Some examples include time complexity, relative com- 
putability, and proof-theoretic strength. In each case, there is a notion of re- 
ducibility between members in the hierarchy. One often searches for a complete, 
or typical, member at each level of the hierarchy. More precisely, an element of 
the hierarchy is called complete for a particular level if it is in that level, and 



if all other elements of the level are reducible to it. We may view automatic 
structures as a complexity class. In that context, we would like to find com- 
plete structures. For this question to be well-defined, we must speciiy a notion 
of reducibility. In light of the results of the previous section, it seems natural to 
consider logical definability of structures as the notion of reducibility. For Biichi 
automatic structures, Biichi's theorem immediately gives a complete structure 
with respect to MSO definability. 

Corollary 7. A structure is Biichi automatic if and only if it is definable in the 
MSO logic of the successor structure {co;S). 

For word automatic structures, it turns out that first-order definability suf- 
fices. Blumensath and Gradel identify the following complete structure [9]. (Each 
of the basic relations in the following structure is defined in Subsection 1.3.) 

Theorem 9 (Blumensath, Gradel; 2000). A structure is word autom,atic if 

and only if it is fi,rst- order definable in the word structure ({0, 1}"^; ^, Left, Right, EqL). 

Proof. One direction is clear because automatic structure are closed under first- 
order definability. For the converse, suppose that A is an automatic structure. 
We will show that it is definable in ({0, 1}*; <, Left, Right, EqL). 

It suffices to show that every regular relation on {0, 1}* is definable in the 
word structure. Without loss of generality, assume that i is a language rec- 
ognized by the word automaton M. = {S, b, A,F). Assume S = {l,...,n} 
with I ~ L (the initial state). We define a formula ^'^^(■y) in the language of 
({0, 1}*; <, Left, Right, EqL) which will hold of the string v if and only if v is 
accepted by M.. We use the following auxiliary definable relations in our defini- 
tion of (pM ■ 

— Length order, \p\ < \x\, is defined by 3i/(t/ :< x h EqL{y,p))); 

— The digit test relation, the digit of x at position \p\ is 0, is defined by 
3y3ziz ^y^xk EqL{y,p) & Left{z,y))); 

— The distinct digits relation states that the digits of a;i and X2 at position \p\ 
are distinct. 

The following paragraph describing a run of A4 on input v can now be trans- 
lated into a single first-order sentence ipM with parameter v. "There are strings 
Xi, . . . ,Xn each of length \v\ + 1. For each p, if \p\ < |u| + 1, there is exactly one 
Xj with digit 1 at position \p\ (the strings Xi, . . . ,Xn describe which states we're 
in at a given position in the run). If Xi has digit 1 at position \p\, and a is the 
digit of V at position \p\, and A(i, a) =^ j then Xj has digit 1 at positions \p\ + 1. 
The digit in the first position of xi is 1. There is some Xj such that j ^ F and 
for which the digit in the last position is 1." Therefore, L is first-order definable 
in ({0, 1}*; :<, Left, Right, EqL). Since the domain and each of the relations of 
an automatic structure are regular, they are also first-order definable. □ 

If we use weak monadic second-order, WMSO, logic (where set quantification 
is only over finite sets) instead of first-order logic as our notion of reducibility, 



we arrive at a more natural complete structure. Blumensath and Gradel [9] 
show that, in this case, the successor structure; is complete. This result has nice 
symmetry with the Biichi case, where a structure is Biichi recognizable if and 
only if it is MSG definable in {uj;S). 

Corollary 8. A structure is word automatic if and only if it is WMSO definable 
in the successor structure {u);S). 

Proof. By Theorem 9, it suffices to give a WMSO definition of the structure 
({0, 1}*; ^, Left, Right, EqL) in [uj; S). To do so, interpret each v e {0, 1}* by 
the set Rep(v) = {i : v{i) = l}U{|t;| + l}. Then Rep(v) is a finite set and for each 
nonempty finite set X there is a string v such that Rep{v) = X. Moreover, under 
this representation, each of Left, Right, <, EqL is a definable predicate. □ 

We refer the reader to [9], [52] ,[53] for issues related to definability in auto- 
matic structures. 

2 Characterization Results 

2.1 Automata on trees and tree automatic structures 

The two flavours of automata wc have presented so far operate on linear inputs: 
finite and infinite strings. We now take a slight detour and consider labelled trees 
as inputs for automata. Our archetypal tree is the two successor structure, 

T= {{0,1}*; Left, Right). 

The root of this binary tree is the empty string, denoted as A. Paths in T are 
defined by infinite strings in {0, 1}'^. Let be a finite alphabet. A iJ-labelled 
tree (T, v) associates a mapping v : T ^ S to the binary tree. The set of all 
i7-labelled trees is denoted by Tree{S). 

A Rabin automaton M. is specified hy M — {S,l,A,!F), where S and t 
are the finite set of states and the initial state, the transition relation is Z\ C 
S X S X {S X S), and the accepting condition is given hy C ViS). An input to 
a Rabin automaton is a labelled tree (T, v). A run of ^4 on (T, v) is a mapping 
r :T ^ S which respects the transition relation in that 

r(A) =i, Vx e T[{r{x),v{x),r{Left{x)),r{Right{x))) e A]. 

The run r is accepting if for every path rj in T, the set 

In(r]) — {s ^ S : s appears on 77 infinitely many times} 

is an element of T. The language of a Rabin automaton A4, denoted as L{A4), 
is the set of all Z'-labellcd trees {T,v) accepted by Ai. 

Example 19. Here are a few examples of Rabin automata recognizable sets of 
{0, l}-labelled trees. 



— {(T, v) : v{x) = 1 for only finitely many x G T} 

— {(T, v) : each path has infinitely many nodes labelled 1} 

— {(T, w) : Vx G T{v{x) — 1 =^ the subtree rooted at x is labelled by Os only)} 

— {{T,v) : 3x e T{v{x) ^ 1)}. 

As you may recall from Subsection 1.3, fundamental facts about Biichi au- 
tomata gave us algorithms for checking emptiness and for constructing new au- 
tomata from old ones. Rabin's breakthrough theorems in [49] yield analogous 
results for Rabin automata. 

Theorem 10 (Rabin; 1969). 

1. There is an algorithm that, given a Rabin automaton M., decides if L{M.) 
is empty. 

2. The class of all Rabin recognizable tree languages is effectively closed under 
the operations of union, intersection, and complementation. 

In the setting of sequential automata, Biichi's theorem (Theorem 2) con- 
nected automata and logic. In particular, the logic used was MSO, where we 
allow quantification both over elements of the domain and over subsets of the 
domain. MSO on the binary tree T can be used to express properties of sets 
of trees such as Path{X),Open{X),Clopen{X). Rabin's theorem [49] connects 
MSO definability and automaton recognizability. Note that, as in the Biichi case, 
convolutions can be used to define Rabin recognizability of relations on trees. 

Theorem 11 (Rabin; 1969). A relation R C P(T)*' is definable in the MSO 
logic of the two successor structure if and only if R is recognizable by a Rabin 
automaton. In particular, the MSO theory of T, denoted as S2S, is decidable. 

This theorem has led to numerous applications in logic and theoretical com- 
puter science. Many of these applications involve proving the decidability of a 

particular theory by reducing it to the MSO theory of the binary tree. 

Rabin automata have natural counterparts which work on finite binary trees. 
Let L{X) denote the set of leaves (terminal nodes) of a finite binary tree X. A 
finite E-tiee is a pair {X, v) where X is a finite binary tree and v : X\L(X) S 
is a mapping which labels non-leaf nodes in X with elements of the alphabet. 
A (top-down) tree automaton is M. = {S, l, A, F) where S, i, F are as in the 
word automatic case, and A C S x E x {S x S) is the transition relation. A run 
of on a finite Z'-tree {X, v) is a map r : X S which satisfies r(A) = i and 
{r{x), v{x), r{Left{x)), r{Right{x))) G A. The run is accepting if each leaf node 
X G L{x) is associated to an accepting state r{x) G F. The language of a tree 
automaton is the set of finite Z'-trees it accepts. As before, tree automata have 
pleasant algorithmic properties (see the discussions in [14], [59], [49]). 

Theorem 12 (Doner; 1965. Thatcher, Wright; 1965. Rabin; 1969). 



1. There is an algorithm that, given a tree automaton A4, decides if L{M) is 
empty. 



2. The class of all tree automata recognizable languages is effectively closed 
under the operations of union, intersection, and complementation. 

Automata which work on trees can be used to define tree automatic struc- 
tures. In this context, domain elements of structures are represented as trees 

rather than strings. 

Definition 5. A structure is tree automatic if its domain and basic relations 
are recognizable by tree automata. A structure is Rabin automatic if its domain 
and basic relations are recognizable by Rabin automata. 

Every tree automatic structure is Rabin automatic (we can pad finite trees 
into infinite ones). Since strings embed into trees, it is easy to see that every 
word automatic structure is tree automatic and that every Biichi automatic 
structure is Rabin automatic. However, this inclusion is strict. For example, the 
natural numbers under multiplication (lu; x) is a tree automatic stnieture but 
(as we will see in the next subsection) it is not word automatic. Similarly, the 
countable atomless Boolean algebra is tree automatic but not word automatic 
[33]. In Lecture 3, we will discuss a recent result which separates Biichi and 
Rabin structures. Although there is a strict separation between the classes of 
sequential-input and branching-input automatic structures, their behaviour in 
terms of definability is very similar. 

Theorem 13 (Rabin; 1969). A structure is Rabin automatic if and only if it 

is MSO definable in the binary tree T. 

Theorem 14 (Rabin; 1969). A structure is tree automatic if and only if it is 

WMSO definable in the binary tree T. 

Similarly, we have a Lowenheim-Skolem Theorem for Rabin automatic struc- 
tures akin to Theorem 4 for Biichi automatic structures. Recall that a U-labelled 
tree is called regular if it has only finitely many isomorphic sub-trees. 

Theorem 15 (Hjorth, Khoussainov, Montalban, Nies; 2008). Let A be 

a Rabin automatic structure and consider the substructure A! whose domain is 

a! = {a & A: a is a regular tree}. 

Then A' is a computable elementary substructure of A. Moreover, there is an 

algorithm that from a first-order formula (p{a,x) with parameters a, from A' 
produces a Rabin automaton which accepts exactly those regular tree tuples x 
which make the formula true in the structure. 

2.2 Proving non-automaticity 

Thus far, our toolbox contains several ways to prove that a given structure is 
automatic (explicitly exhibiting the automata, using extended first-order defini- 
tions, and using interpretations in the complete automatic structures). However, 



the only proof we've seen so far of non-automaticity is restricted to word au- 
tomata and uses cardinality considerations (cf. Subsections 1.3, 1.5). We could 
also use general properties of automatic structures to prove that a given struc- 
ture is not automatic; for example, if a structure has undecidable first-order 
theory then it is not automatic by Corollary 2. We will now see a more careful 
approach to proving non-automaticity. 

In finite automata theory, the Pumping Lemma is a basic tool for showing 
non-regularity. Recall that the lemma says that if a set L is regular then there is 
some n so that for every w £ L with |w| > n, there are strings x, u, v with \u\ > 
such that w = xuv and for all rn, xu"^v € L. The constant n is the number of 
states of the automaton recognizing L. The following Constant Growth Lemma 
uses the Pumping Lemma to arrive at an analogue for automatic structures [29] . 

Lemma 1 (Khoussainov, Nerode; 1995). /// : D" ^ D is a function whose 
graph is a regular relation, there is a constant C ( which is the number of states 
of the automaton recognizing the graph of f) such that for all xi, . . . ,Xn & D 

\f{xi, . . .,Xn)\ < max{|a;i|, . . . , |a;„|} + C. 

Proof. Suppose for a contradiction that . . . ,Xn)\ — max{|a;i|, . . . , \xn\} > 

C. Therefore, the convolution of the tuple {xi, . . . , Xn, f{xi, . . . , Xn)) contains 
more than C o's appended to each Xi. In particular, some state in the au- 
tomaton is visited more than once after all the x^'s have been read. As in the 
Pumping Lemma, we can use this to obtain infinitely many tuples of the form 
(xi, . . . , Xn, y) with y ^ /(a^i, . . . a;„) accepted by the automaton, or it must be 
the case that the automaton accepts strings which do not represent convolutions 
of tuples. Both of these cases contradict our assumption that the language of 
the automaton is the graph of the function /. □ 

The Constant Growth Lemma can be applied in the settings of automatic 

monoids and automatic structures in general to give conditions on automaticity 
[33]. Recall that a monoid is a structure (M; •) whose binary operation • is 
associative. 

Lemma 2 (Khoussainov, Nies, Rubin, Stephan; 2004). // (M; •) is an 

automatic monoid, there is a constant C ( the number of states in the automaton 
recognizing ■ ) such that for every n and every si, . . . ,Sn & M 

\si-S2 s„| < max{|si|, |s2|, • • • , + C ■ riog(n)]. 

Proof. Let C be the number of the states in the automaton recognizing the 
graph of the monoid multiplication. We proceed by induction on n. In the base 
case, the inequality is trivial: |si| < j.sij. For n > 1, write n = u + v such that 

u = L^J . Note that u < n and v < n. Let xi = Si s„ and X2 = Su+i s„. 

By the induction hypothesis, |a;i| < max{|si|, . . . , |s„|} -|- (7 • [log(u)] and \x2\ < 



niax{|s„+i|, . . . , |s„|} + C ■ \log{v)~\. Applying Lemma 1, 

\si Sn\ = \xi • a;2| < max{|a;i|, |a;2|} + C 

< max{|si|, . . . , \sn\} + Cmax{[log(u)], [log(u)]} + C 

< max{|si|,...,|s„|} + Criog(n)]. 

□ 

Let A = {A: Fq, Fi, . . . , F„) be an automatic structure. Let X — {xi, X2, ■ ■ ■} 
be a subset of A. The generations of X are the elements of A which can be 
obtained from X by repeated applications of the functions of A. More precisely, 

Gi{X) = {xi} and G„+i(X) = U {Fi(o) : a G G'„(X)} U {a;„+i}. 

The Constant Growth Lemma dictates the rate at which generations of X grow 
and yields the following theorem. 

Theorem 16 (Khoussainov, Nerode; 1995. Blumensath; 1999). Suppose 
X C A and there is a constant Ci so that in the length lexicographic listing of 
X (xi <iiex X2 <iiex ■ ■ ■ ) ws- httve \xn\ < Ci ■ u fov all u > 1. Then there is a 
constant C such that \y\ < C - n for all y G G„(X). In particular, 

Gn{X) C r^c-.n 
if \E\ > I, and <C-nif\S\ = l. 

Just as the Pumping Lemma allows immediate identification of certain non- 
regular sets, the above theorem lets us determine that certain structures are not 

automatic. 

Corollary 9. The free semigroup ({0, 1}*; •) is not word automatic. Similarly, 
the free group F{n) with n > 1 generators is not word automatic. 

Proof. We give the proof for the free semigroup with two generators. Consider 
X = {0,1}. By induction, we sec that for each n, {0,1}^^" C Gn+idO, 1}). 
Therefore, |G„+i(X)| > 2^"~^ - 1 and hence can't be bounded by 2*-^'" for any 
constant C. □ 

Similarly, one can prove the following. 

Corollary 10. For any bijection f : ojxu} ^ u), the structure (w; /) is not word 
automatic. 

The proofs of the next two corollaries require a little more work but employ 
growth arguments as above. 

Corollary 11. The natural numbers under multiplication {ui; x) is not word 

automatic. 

Corollary 12. The structure (w; <, {n\ : n G tu}), where the added unary pred- 
icate picks out the factorials, is not word automatic. 



We now switch our focus to subclasses of (word) automatic structures. For 
some of these classes, structure theorems have been proved which lead to good 
decision methods for questions like the isomorphism problem. Such structure 
theorems must classify both the members of a class and the non-members. Hence, 
techniques for proving non-automaticity become very useful. In other cases, as 
we will see in the next lecture, complexity results give evidence that no nice 
structure theorems exists. The classes we consider below are partial orders, linear 
orders, trees. Boolean algebras, and finitely generated groups. 

2.3 Word automatic partial orders 

The structure (A: <) is a partially ordered set if the binary relation < is 
reflexive Va;(a; < x), anti-symmetric \lxiy{x < y ^ y < x ^ x = y), and 
transitive \/x\/y\/z{x <yk,y<z^x<z). A. partially ordered set {A; <) is 
word automatic if and only if A and < are both recognized by word automata. 
For the rest of this section, we deal only with word automatic partial orders and 
for brevity, we simply call them automatic. 

Example 20. We have already seen some examples of automatic partial orders: 
the full binary tree under prefix order ({0, 1}*; ^); the finite and co-finite subsets 
of w under subset inclusion; the linear order of the rational numbers. 

Recall that a linear order is one where < is also total: for any x, y in the 

domain, either x < y or y < x. 

Example 21. Small ordinals (such as lj" for n finite) [29] are automatic partial 
orders. In fact, Delhomme showed that automatic ordinals are exactly all those 
ordinals below w'^ [13] . We will see the proof of this fact later in this subsection. 

Example 22. The following example was one of the first examples of a non-trivial 

automatic linear order [52] . Given an automatic linear order L and a polynomial 
f{x) with positive integer coefficients, consider the linear order 

S^^^{L + f{x)) 

where we have a copy of L, followed by a finite linear order of length /(O), 
followed by another copy of L, followed by a finite linear order of length /(I), 
etc. This linear order is automatic. In fact, the linear order obtained by the same 
procedure where the function / is an exponential f{x) = a^'^+'^ with a,b,c G u) 
is also automatic. 

The last example involves the addition of linear orders. The sum of orders 
Li and L2 is the linear order in which we lay down the order Li and then we 
place all of L^. Since Li + L2 is first-order definable from the disjoint union 
of Li and L2, Proposition 1 and Corollary 3 imply that the sum operation 
preserves automaticity. Another basic operation on linear orders also preserves 
automaticity: the product linear order Li ■ L2 is one where a copy of Li is 



associated with each element of Z/2; each of the copies of Li is ordered as in Li, 
while the order of the copies is determined by £2- The order Li ■ L2 is also first- 
order definable from the disjoint union of Li and L2, and hence is automatic if 
Li and L2 are automatic. 

We use several approaches to study the class of automatic partial orders. 
First, we restrict to well-founded partial orders and consider their ordinal heights. 
We will see that automatic partial orders are exactly those partial orders with 
relatively low ordinal heights. This observation parallels Delhomme's previously 
mentioned result that automatic ordinals are exactly those below a;" [13]. Next, 
we study automatic linear orders. We present results about the Cantor-Bendixson 
ranks of automatic linear orders, and see the implications of these results for 
decidability questions. In particular, wc sec that the isomorphism problem for 
automatic ordinals is decidable. Finally, we consider partial orders as trees and 
consider the branching complexity of automatic partial order trees. We present 
several automatic versions of Konig's famous lemma about infinite trees. 

We now introduce well-founded partial orders and ordinal heights. A bi- 
nary relation R is called well-founded if there is no infinite chain of elements 
Xo, xi, X2, ■ ■ ■ such that (x^+i, Xi) G R for all i. For example, (Z+; S) is a well- 
founded relation but (Z~ ; S) is not well-founded (we use Z"*" and Z~ to de- 
note the positive and negative natural numbers). Given a well-founded structure 
A = {A; R) with domain A and binary relation R, a ranking function for A is 
an ordinal-valued function f on A such that f{y) < f{x) whenever {y,x) € R. 
We define ord{f) as the least ordinal larger than or equal to all values of /. It is 
not hard to see that A = {A; R) is well-founded if and only if A has a ranking 
function. 

Given a well-founded structure A, its ordinal height (denoted r{A)) is the 
least ordinal q: whic;h is ordig) for some ranking function g for A. An equivalent 
definition of the ordinal height uses an assignment of rank to each element in 
the domain of A. If x is an i?-minimal element of A, set rj\^{x) = 0. For any 
other element in A, put ^^(z) — sup{r(j/) -|- 1 : {y,z) G R}. Then, we define 
r{A) = sup{r^(a;) : x & A}. The following property of ordinal heights is useful 
when we work with substructures of well-founded relations. 

Lemma 3. Given a well-founded structure A = {A, R), if r{A) = a and (3 < a 
then there is x £ A such that r^(a;) = /?. 

The ordinal height can be used to measure the depth of a structure. In 
our exploration of automatic structures, we study the ordinal heights attained 
by automatic well-founded relations. As a point of departure, recall that any 
automatic structure is also a computable structure (see Subsection 1.2). We 
therefore begin by considering the ordinal heights of computable structures. An 
ordinal is called computable if it is the order-type of some computable well- 
ordering of the natural numbers. 

Lemma 4. Each computable ordinal is the ordinal height of some computable 
well-founded relation. Conversely, the ordinal height of each computable well- 
founded relation is a computable ordinal. 



Since any automatic structure is a computable structure, Lemma 4 gives us 
an upper bound on the ordinal heights of automatic well-founded relations. We 
now ask whether this upper bound is sharp. We will consider this question both 
in the setting of all automatic well-founded relations (in Lecture 3), and in the 
setting of automatic well-founded partial orders (now). The following theorem 
characterizes automatic well-founded partial orders in terms of their ordinal 
heights. 

Theorem 17 (Khoussainov, Minnes; 2007). An ordinal a is the ordinal 
height of an automatic well-founded partial order if and only if a < oj'^. 

One direction of the proof of the characterization theorem is easy: each ordi- 
nal below Lu'^ is aiitomatic (Example 21) and is an automatic well-founded total 
order. Moreover, the ordinal height of an ordinal is itself. 

For the converse, we will use a property of the natural sum of ordinals. The 
natural sum of a and f3, denoted a +' /?, is defined recursively a,s a +' = a, 
+' (3 = (3, and a +' P is the least ordinal strictly greater than 7 -|-' /3 for all 
7 < a and strictly greater than a +' j for all 7 < An equivalent definition 
of the natural sum uses the Cantor normal form of ordinals. Recall that any 
ordinal can be written in this normal form as 

a = w^^ni + w^=^n2 H h ui'^''nk, 

where Pi > > ■ ■ ■ > Pk and fc, ni, . . . , € N. We define 

{w^^ ai + • • • + 0;^*= afe) +' {lj^^ 61 + • • • + 0;'^'= 6fe ) = w'^^ (ai + 61 ) + • • • + 0;'^'= (afc + 6fe) . 

The following lemma gives sub-additivity of ordinal heights of substructures with 

respect to the natural sum of ordinals. 

Lemma 5. Suppose A = {A; <) is a well-founded partial order and Ai,A2 form 
a partition of A (A\ U A2 = A, a disjoint union). Let Ai = {Ai; <i),A2 = 
{Ai; <2) be obtained by restricting < to Ai,A2. Then r{A) < r{Ai) +' r{A2)- 

Proof. For each x € A, consider the sets Ai^^ = {z E Ai : z < x} and 
A2,x = {z G A2 : z < x}. The structures Ai^x,A2,x are substructures of Ai,A2 
respectively. Define a ranking function of A by f{x) = r{Ai^x) +' r{A2,x)- The 
range of / is contained in r{Ai) +' r{A2). Therefore, r{A) < r{Ai) +'r{A2). □ 

We now outline the proof of the non-trivial direction of the characteriza- 
tion theorem of automatic well-founded partial orders. Note that this proof fol- 
lows Delhomme's proof that ordinals larger than w'^ are not automatic [13]. 
We assume for a contradiction that there is an automatic well-founded partial 
order A = iA;<) such that r{A) = a > w". Let Ma = ('S'a, -4^, Fa), 
M< = {S<,i<,A<,F<) be the word automata which recognize A and < (re- 
spectively). For each u £ A, the set of predecessors of u, denoted by u |, can be 
partitioned into finitely many disjoint pieces as 

u i= {x € A: \x\ <\u\ k X <u} U \_\ 

v^U* :\v\ — \u\ 



where X" — {vw € A : vw < u} (extensions of v which are predecessors of u). 
Since r{A) > w", Lemma 3 guarantees that for each n, there is an element w„ € A 
with r^(w) = w". Moreover, Lemma 5 imphes that if a structure has ordinal 
height w", any finite partition of the structure contains a set of ordinal height w". 
In particular, for each u„ there iw (,'„ such that j;;,, | = |u„| and r(X"^) = r(?i„) |= 
w". We now use the automata A4a,.M< to define an equivalence relation of 
finite index on pairs {u, v): {u, v) ~ (u', v') if and only if zi^(i^, v) = Aa{la,v') 
and A<{l<, (J^)) = A<{i.<, (^/))- Suppose that {u,v) ~ {u',v'). Then the map 
/ : X^, defined as f{vw) = v'w is an order-isomorphism. Hence, r{X^) = 

r{X^, ). Also, there are at most \Sa\ x \S< \ ^equivalence classes. Therefore, the 
sequence contains some m,n {rn / n) such that {um,Vm) ^ (w„,w„). 

But, Lj"' = r{u,„) = /-(X;;;;') = r(X;'-) = r(?i„) = w". This is a contradiction 
with m ^ n. Thus, there is no automatic well-founded partial order whose ordinal 
height is greater than or equal to w'^. □ 

The above characterization theorem applies to automatic well-founded partial 
orders. We now examine a different class of automatic partial orders: linear 
orders. We seek a similar characterization theorem for automatic linear orders 
based on an alternate measure of complexity. Let (L; <) be a linear order. Then 
x,y G L are called = i?-equivalent if there are only finitely many elements between 
them. We can use =f equivalence to measure how far a linear order is from 
"nice" linear orders like {u; <) or (Q; <). To do so, given a linear order, we take its 
quotient with respect to the =f equivalence classes as many times as needed to 
reach a fixed-point. The first ordinal at which the fixed-point is reached is called 
the Cantor-Bendixson rank of the linear order, denoted CB{L; <). Observe 
that CB{Q; <) = and CB{uj-, <) = 1. Moreover, the fixed-point reached after 
iteratively taking quotients of any linear order will either be isomorphic to the 
rational numbers or the linear order with a single element. A useful lemma tells 
us that automaticity is preserved as we take quotients by =p. 

Lemma 6. If C = {L;<) is an automatic linear order then so is the quotient 
linear order C/ =f- 



Proof. By Proposition 2, it suffices to show that =i? is definable in the extended 
logic {FO 3°°) of (L; <). The definition is 



Theorem 17 showed that well-founded automatic partial orders have rela- 
tively low ordinal heights. In this vein, it is reasonable to expect a low bound 
on the Cantor-Bendixson rank of automatic linear orders as well. The following 

characterization theorem does just that. This characterization theorem and its 
implications for linear orders and ordinals arc discussed in [31], [32]. 

Theorem 18 (Khoussainov, Rubin, Stephan; 2003). An ordinal a is the 
Cantor-Bendixson rank of an automatic linear order if and only if it is finite. 




□ 



The proof of Theorem 18 has many common features with the proof of Theo- 
rem 17. One direction is easy: for n < ui, uj" is automatic and CB(a;") = n. The 
hard direction rehes on understanding the Cantor-Bendixson ranks of suborders 
of a given hnear order. In particular, we make use of suborders determined by 
intervals of a linear order: sets of the form {2; : a; < ^ < y} for some x and y. 

Lemma 7. For any linear order C and ordinal a, if CB{C) = a and f3 < a 
then there is an interval [x,y] of C with CB{[x,y]) = (5. 

To prove that any linear order with infinite Cantor-Bcndixson rank is not 
automatic, we go by contradiction. We suppose that such a linear order exists, 
and use the associated automata to define an equivalence relation on intervals 
which has only finitely many equivalence classes. Moreover, intervals in the same 
equivalence class have the same Cantor-Bendixson rank. However, since we as- 
sume that the linear order has infinite Cantor-Bendixson rank. Lemma 7 allows 
us to pick out intervals with every finite Cantor-Bendixson rank. Therefore, two 
such intervals must be in the same equivalence class. But this contradicts our 
choice of intervals with distinct Cantor-Bendixson ranks. □ 

Theorem 18 has been incredibly productive for decidability results. In par- 
ticular, it yields algorithms for computing the Cantor-Bendixson rank of a given 
automatic linear order, and for studying scattered linear orders and ordinals. 
A linear order is called dense if for each x and y with x < y, there is some z 
such that X < z < y. The linear orders with zero elements or one element are 
trivially dense. For countable linear orders, there are exactly four isomorphism 
types of non-trivial dense linear orders (the rational numbers restricted to (0, 1), 
the rational numbers restricted to [0, 1), the rational numbers restricted to (0, 1], 
the rational numbers restricted to [0, 1]). Note that being dense is a first-order 
definable property, so Corollary 2 tells us we can decide if a given automatic lin- 
ear order is dense. A linear order is called scattered if it contains no non-trivial 
dense sub-order. A linear order is an ordinal if it is well-founded. 

Corollary 13. There is an algorithm which, given an automatic linear order C, 
computes the Cantor-Bendixson rank of C. 

Proof. Check if C is dense. If it is, output CB{C) = 0. Otherwise, Lemma 

6 tells us that the quotient C/ =f is automatic. We iterate checking if the 
quotient is dense and, if it is not, constructing the next quotient and incrementing 
the counter. Each of these steps is effective because denseness is a first-order 

question. Moreover, this procedure eventually stops by Theorem 18. Once we 
reach a dense quotient structure, we output the value of the counter. □ 

The following two corollaries about automatic scattered linear orders use 
trivial modifications of the above algorithm. 

Corollary 14. It is decidable if a given autom,atic linear order is scattered. 

CorollEiry 15. Given an automatic linear order £ that is not scattered, there is 
an algorithm which computes an automatic dense suborder of L. 



We now apply Theorem 18 to the subclass of linear orders which are well- 
founded, the ordinals. We will see that we can effectively check if a given au- 
tomatic linear order is an ordinal; and given two automatic ordinals, we can 
check if they are isomorphic. The isomorphism question is one of the central 
motivating questions in the study of automatic structures (see Subsection 1.1). 
The class of automatic ordinals was one of the first contexts in which a positive 
answer to this question was found. 

Corollary 16. // C is an automatic linear order, there is an algorithm which 
checks if jC is an ordinal. 

Proof. To check if a given automatic linear order C is an ordinal, we need to check 

if it has an infinite descending sequence. Note that infinite descending sequences 
can occur either within an =i7' equivalence class or across such classes. We begin 
by checking whether £ is not dense and V(a; € L)3°°y{x =f y ^ V < x). If 
this condition holds, we form the quotient C/ ~f and check the condition again 
for the quotient linear order. We iterate until the condition fails, which must 
occur after finitely many iterations because CB{C) is finite. If the resulting 
linear order has exactly one element, output that C is an ordinal, and otherwise 
output that it is not. If C is an ordinal then the =p equivalence classes are all 
finite or isomorphic to u) and all quotient linear orders of C are also ordinals. 
Therefore, the algorithm will stop exactly when the quotient is dense, in which 
case it will have exactly one element. If C is not an ordinal, then either there 
will be a stage of the algorithm at which there is an infinite descending chain 
within a single =f equivalence class or the final dense linear order will contain 
an infinite descending chain. In either case, the algorithm will recognize that C 
is not an ordinal. □ 

Corollary 17. The Cantor normal form of a given automatic ordinal is com- 
putable. 

Proof. Given an automatic ordinal C, we use first-order definitions of maximal 
elements and the set of limit ordinals in C to iteratively determine the coefficients 
in the Cantor normal form. The set of limit ordinals play a role because if 
a = Lu™am + • • • + wai, then the Cantor normal form of the set of limit ordinals 
strictly below a is uj^^^^am + • • • + oj^a2 + ai. □ 

Since two ordinals are isomorphic if and only if they have the same Cantor 
normal form, the following corollary is immediate. 

Corollary 18. The isomorphism problem for automatic ordinals is decidable. 

We do not know whether the isomorphism problem for automatic linear or- 
ders is decidable. 

2.4 Word automatic trees 

We now focus our interest on partial orders which form trees. We begin with the 
following definition of trees. 



Definition 6. A (partial order) tree is A= {A; <) where < is a partial order 
such that A has a <-least element and the set x i (the <-predecessors of x) is 
linearly ordered and finite for all x € A. 

For any regular language L, if L is prefix-closed then the structure (L; ^) 
where ^ is the prefix relation is a (word) automatic tree. The two most famous 
such examples are the full binary tree ({0, 1}*; ^) and the countably branching 
tree cj^" = ({0.1}* • 1;^). In the following, for a given tree T = (T; <) we 
denote by 7^ the subtree of T which is rooted at node x; that is, %c is the tree 
i{yeT:x<y};<). 

There is a natural connection between trees and topological spaces: given 
a tree T = (T;<), define an associated topological space whose elements are 
infinite paths of T and whose basic open sets are collections of infinite paths 
defined by {P : x G P} for each x G T. Then the topological Cantor-Bendixson 
rank transfers to trees in a straightforward way. As in the case of linear orders, we 
can compute this rank via an iterative search for a fixed-point. The derivative 
is defined to be d{T), the subtree of T containing all nodes x G T such that 
X belongs to two distinct infinite paths of T. The derivative can be carried 
along the ordinals by letting rf"+^(T) = d{d°'{T)) and, for limit ordinals 7, 
^7(7") = n/5<^.rf^(T). The first ordinal a for which (i"(r) = d"+i(T) is called 
the Cantor-Bendixson rank of T and is denoted by CB{T). The proof of the 
following lemma gives some intuition about Cantor-Bendixson ranks and is left 
to the reader. 

Lemma 8. // a tree has only countably many paths, its Cantor-Bendixson rank 
is or a successor ordinal. 

As in our discussion of linear orders, it is natural to look for low bounds on 
the complexity of automatic trees. In particular, the Cantor-Bendixson ranks of 
trees and linear orders are intimately connected. A linear order can be associated 
with each automatic finitely branching tree. The Kleene-Brouwer ordering 
of the tree T = (T; <) is given by x <kBt U if aiid only if a; = y or y < a; or 
there are u,v,w gT such that v, w are immediate successors of u and v <uex w 
and V < X and w < y. Note that the definition of the Kleene-Brouwer order uses 
the length-lexicographic ordering inherited by T as a subset of S* for some finite 
alphabet S. It is not hard to see that the Kleene-Brouwer order is a linear order. 
For example, the Kleene-Brouwer order associated to the tree which has exactly 
two infinite paths (0* U 1*; ^) is isomorphic to u* + u*. li T is an automatic 
tree then the relation <kbt is first-order definable in it. Hence, the results in 
Subsection 2.3 imply that the Kleene-Brouwer ordering of an automatic finitely 
branching tree is automatic. 

Lemma 9 (Khoussainov, Rubin, Stepiian; 2003). If T is an automatic 
finitely branching tree with countably many paths then CB{T) = CB{KBx)- 

Proof. The outline of the proof is as follows; for details, see [31]. We claim that 
if T is an automatic finitely branching tree with countably many paths then 



KBr is scattered. Given any infinite path of T, the linear order KBq- can be 
expressed as a countable sum of linear orders which are either trivial or KBs 
for S a subtree of T. By induction, we will show that CB{T) = CB{KBt)- If 
CB{T) = then it is empty so KBr is also empty and CB{KBr) = 0. For the 
inductive step, we suppose that CB{T) ~ (3 + 1 and notice that the subtree of 
T whose domain is X = {x €: T : CB{%) — (3 + 1} has a finite and nonzero 
number of infinite paths. Each of these paths give rise to a sum expression of 
KB-j-^, where CB{KBtS) = /3+ 1- Then KBq- is the sum of these finitely many 
linear orders KBq-^ and so KBr = /3 + 1 as well. □ 

The lemma above allows us to transfer the bound on Cantor-Bendixson ranks 
of automatic linear orders to a bound on the Cantor-Bendixson ranks of auto- 
matic finitely branching trees with countably many paths. 

Theorem 19 (Khoussainov, Rubin, Stephan; 2003). IJT is an automatic 

finitely branching tree with countably many paths then CB{T) is finite. 

Proof. Suppose T is as above. Then KBq- is also automatic. By Theorem 18, 
CB{KBr) is finite. Lemma 9 then gives that CB{T) is finite. □ 

We will improve Theorem 19 by weakening its hypotheses. Our goal is to 
show that the Cantor-Bendixson rank of each automatic partial order tree is 
finite. 

Theorem 20 (Khoussainov, Rubin, Stephan; 2003). IfT is an automatic 

finitely branching tree then CB{T) is finite. 

Proof. We consider two kinds of nodes on T: those which lie on at most countably 
many infinite paths, and those which lie on uncountably many infinite paths. 
For each x which lies on at most countably many infinite paths consider 7^ , the 
subtree of T rooted at x. Note that KBj-^ is an interval of KBq- for each x. By 
Lemma 9 and Theorem 18, CB{%) = CB{KBrJ < CB{KBr) = n for some 
n. Therefore, (i"(T) contains only nodes which lie on uncountably many infinite 
paths. In particular, this implies that d"+i(T) = d"(T) so CB{T) = n. □ 

Finally, we remove the requirement that the tree is finitely branching. 

Theorem 21 (Khoussainov, Rubin, Stephan; 2003). IfT is an automatic 

tree then CB{T) is finite. 

Proof. Given an automatic tree T = (T; <), we consider a finitely branching 
tree which is first-order definable in T and whose Cantor-Bendixson rank is no 
less than that of T. The tree T' has domain T and partial order <' defined by 

X <' y "^=^ X <yy 3v3w[x e S{v) k, w £ S{v) & x <uex w k, w <y), 

where S is the immediate successor function with respect to <. To picture T', 
note that each node has at most two immediate successors: its <(;ex-least suc- 
cessor from T, and its <Hea;-least sibling in T. We can define a continuous 
and injective map from the infinite paths of T to the infinite paths of T'. 
Analysing the derivatives of T and T' via this map leads to the conclusion 
that CB{T) < CB{T'). Applying Theorem 20 to T', CB{T) is finite. □ 



Corollary 19. An ordinal a is the Cantor-Bendixson rank of an automatic 
(partial order) tree if and only if it is finite. 

One of the most fundamental tools available for the study of finitely branch- 
ing trees is Konig's lemma: every finitely branching infinite tree has an infinite 
path. It is natural to wonder whether such a result holds in the context of au- 
tomatic structures. There are several ways one can transfer results from mathe- 
matics to include feasibility constraints. For example, we can ask whether every 
automatic finitely branching infinite tree has a regular infinite path. A stronger 
result would be to find a regular relation which picks out the infinite regular 
paths through automatic trees. We will now develop several automatic versions 
of Konig's lemma in this spirit. As a starting point, recall Corollary 6: every 
infinite automatic finitely branching tree has a regular infinite path. A surpris- 
ing feature of the landscape of automatic structures is that a version of Konig's 
lemma holds even when we remove the finitely branching assumption. 

Theorem 22 (Khoussainov, Rubin, Stephan; 2003). // an infinite auto- 
matic tree has an infinite path, it has a regular infinite path. 

Proof. The first-order definition of an infinite path in Corollary 6 relied on a 
definition of the set of nodes which lie on any infinite path. However, in the 
context of trees which may have nodes with infinite degree, such a definition is 
harder to find. To show that this set of nodes is regular, we define an auxiliary 
Biichi recognizable set which tracks nodes on infinite paths. Our desired regular 
set is then achieved via applications of the projection operation and decoding of 
the Biichi automaton. For more details, see [31]. □ 

A full automatic version of Konig's lemma would produce a regular relation 
which codes all infinite paths through a given automatic tree. The following 
theorem from [31] does just this. 

Theorem 23 (Khoussainov, Rubin, Stephan; 2003). Given an automatic 
tree T , there is a regular relation R{x, y, z) satisfying the following properties: 

— 3y3z (^R{x,y,z) <J=> Tx has at most countably many infinite paths); 

— for each y G S* and for each x such that 7^ has at most countably many 
infinite paths, the set Rx,y = {z : R{x, y, z)} is either empty or is an infinite 
path through %; and 

— for each x such that has at most countably many infinite paths, if a is an 
infinite path through 7^ then there is ay & E* such that Rx,y = ct. 

In this section and in the preceding one, we gave tight bounds on the ranks 
and heights of automatic well-founded partial orders, trees, ordinals, and partial 
order trees. Such bounds indicate that the level of complexity of the structures in 
these classes is relatively low. However, it does not give us concrete information 
on the members of these classes. In the next section, we present classification 
theorems for various collections of automatic structures. 



2.5 Word automatic Boolean algebras 

We now use the results about non-automaticity from Subsection 2.2 to give 
a classification of word automatic Boolean algebras. Recall that a Boolean 
algebra is a structure (D; V, A, 0, 1) which satisfies axioms relating the join 
(V), meet (A), and complement (-i) operations and the constants and 1. An 
atom of a Boolean algebra is a minimal non-0 element. An archetypal Boolean 
algebra is B^^, the collection of all finite and co-finite subsets of ui. In this case, 
the Boolean operations are the Boolean set operations: V — U, A — D, ^ = °. 
As we saw in Subsection 1.5, Bu; has a word automata presentation. For each 
n > 1, we define B" to be the n-fold Cartesian product of B^j. Since finite 
Cartesian products preserve automaticity, B^ is automatic. The main theorem 
of this section says that each automatic Boolean algebra must be isomorphic to 
for some n. 

Theorem 24 (Khoussainov, Nies, Rubin, Stephan; 2004). A Boolean al- 
gebra is word automatic if and only if it is isomorphic to Z?" for some n > 1. 

The discussion above shows that any Boolean algebra isomorphic to B^ is 
word automatic. We now prove the converse. Let B be an automatic Boolean 
algebra. Assume for a contradiction that B is not isomorphic to any BJ^. We 
will construct a set of elements of B which will contradict the Growth Lemma 
for Monoids (Lemma 2). To do so, we recall some terminology for elements of 
Boolean algebras. We say that a, b E B arc F-cquivalcnt, a =p bAi the element 
(a A 6) V (a A b) is a union of finitely many atoms of B. Note that B/ =f is 
itself a Boolean algebra. Moreover, B/ =p is automatic because the equivalence 
relation =^ is (FO + 3°°) definable in B (Proposition 2). Moreover, since B is 
not isomorphic to B^ for any n, B/ =f is infinite. We call x € B infinite if 
there are infinitely many elements y G B such that y Ax = y (i.e. y < a; in the 
partial order induced by the Boolean algebra). We say that x splits y in ;B if 
X Ay ^0 and x Ay 0. We call x € B large if the =f equivalence class of x is 
not a finite union of atoms of the quotient algebra. The following lemma collects 
properties of infinite and large elements of a Boolean algebra. 

Lemma 10. // y is large then there is x that splits y such that y A x is large 
and y Ax is infinite. If y is infinite then there is x that splits y such that either 
X Ay is infinite or x Ay is infinite. 

We use Lemma 10 to inductively define a sequence of trees {Tn} whose nodes 
are binary strings. To each node cr of the tree 7^ we associate an element b„ of 
the Boolean algebra. We denote the set of elements of B associated to the leaves 
of the tree 7^ by Xn- At each step of the induction, we verify that the following 
properties hold: 

— There is a leaf u e 7^ such that b„ is large in B. 

— There arc n leaves cti , . . . , (T„ € 7^ such that bc^ is infinite. 

— The tree 7^ has as least "^"2*"^^ many leaves. 

— If C7 and T are distinct leaves of 7^, 6^ A 6r = 0. 



The base tree is Tg = {A} (where A is the empty binary string) and bx = 1. 
Given 7^, we define 7^+i as the cixtcnsion of 7^ obtained by doing the following 
for each leaf a G 7^. If 6o- is large, let a € .B be the length lexicographically first 
element that splits ba such that both b„ Aa and b„ Aa are infinite and one of 
them is large. Such an a exists by Lemma 10. Add ctO and crl to 7^+i (as leaves) 
and let b^^Q = b^ Aa and b^i = b„Aa. If bcr is one of the n leaves from 7^ for 
which ba- is infinite but is not large, let a € B be the length lexicographically 
first clement of B that splits b^ such that one of b^ A a and b^- A a is infinite. 
Again, this a exists by Lemma 10. Add crO and crl to 7^+i and let b^o = b^ Aa 
and bai = b^Aa. For any leaf cr e 7^ that does not fall into either of these cases, 
let cr G 7^+1 also be a leaf. 

The first induction hypothesis holds for Tn+i because any leaf in 7^ associated 
to a large element in B is extended by at least one node associated to a large 
element in B. Likewise, the second hypothesis is preserved because each of the 
n nodes associated to infinite elements arc extended by a node associated to an 
infinite element, plus the node associated to a large element is extended by a 
node associated to an infinite element. The third hypothesis holds because at 
least n+1 leaves of 7^ are replaced by two leaves each, so if has at least "^"^^"'"•^ 
leaves then 7^+i has at least ("+iK"+^) niany leaves. Finally, the last hypothesis 
is preserved because if r G 7^ is a leaf, r A (6^ A a) = (r A 5^) A a = A a = 0. 

Each clause in the inductive definition can be formalized in (FO+3°°). Hence, 
for all n, %i is word automatic. In particular, the fimctions /o,/i : Xn — > Xn 
which map fo{ba) = b^o and fiib^) = b^i have regular graphs. Therefore, the 
Constant Growth Lemma (Lemma 1) implies that there are constants Co, Ci 
such that 

l&^ol < \ba\+Co and < \b,\+C\ 

In particular, there is a constant C2 such that for all x G Xn, \x\ < C2 -n. In other 
words, Xn ^ i7'^2 ". Lemma 2 then gives that the Boolean algebra generated by 
Xn is also a subset of S^^"-^ and therefore has at most 2*^("' elements. On the 
other hand, since there are at least many leaves in 7n and distinct leaves 

yield disjoint elements of Xn, the Boolean algebra generated by Xn has size at 
least 2 2 . This is a contradiction, and proves Theorem 24. □ 

The classification of Boolean algebras in Theorem 24 gives rise to an algo- 
rithm solving the isomorphism problem for automatic Boolean algebras. 

Corollary 20. It is decidable whether two automatic Boolean algebras are iso- 
morphic. 

Proof. By Theorem 24, two automatic Boolean algebras arc isomorphic if and 
only if they are isomorphic to B^ for the same n. Given an automata presenta- 
tions of a Boolean algebra, it satisfies the first-order expressible property "there 
are n disjoint elements each with infinitely many atoms below, and for each 
m > n there aren't m disjoint elements each with infinitely many atoms below 
them" if and only if it is isomorphic to B". To decide if two automatic Boolean 



algebras 61,82 are isomorphic, search for m and n2 such that Bi = B^'^ and 
B2 — B'^'^ and then reply "yes" if and only if m = n2. □ 

We have seen explicit descriptions for which ordinals and which Boolean 
algebras have automata presentations. We have also seen theorems which give 
conditions on invariants (ordinal height, Cantor-Bendixson rank) of structures 
with automatic presentations. In the following subsection, we will continue to 
accumulate results of both kinds. 

2.6 Word automatic finitely generated groups 

We now turn to finitely generated groups which have automata presentations. 
As in the previous subsections, we consider only word automata presentations. 
Recall that a group is a structure (G; •, ~^,e) where the group operation is 
associative and the inverse behaves as expected with respect to the identity. 
In Subsection 1.5, we saw that finitely generated abelian (commutative) groups 
have automata presentations. In Subsection 2.2, we saw that the free group F{n) 
with n > 1 is not automatic. These two examples represent extreme behaviours 
with respect to automata. We now work towards finding the exact boundary 
between automaticity and non-automaticity for groups. 

Recall that a subgroup is a subset of a group which is itself a group; the 
index of a subgroup is the number of left cosets of the subgroup. A good intro- 
duction to basic group theory is [51]. 

Definition 7. A group is virtually abelian if it has an abelian subgroup of 
finite index. Similarly, for any property X of groups, we say that a group is 
virtually X if it has a subgroup of finite index which has property X. 

Definition 8. A group is torsion- free if the only element of finite order (the 
least number of times it must be multiplied by itself to yield the identity) is the 
identity. A subgroup N of G is normal if for all a G G, aN = Na. 

The following basic fact from group theory will play a key part in our anal- 
ysis of automata presentable finitely generated groups. It says that the abelian 
subgroup of a virtually abelian finitely generated group can be assumed to have 
a special form. 

Fact 25 Every virtually abelian finitely generated group has a torsion-free nor- 
mal abelian subgroup of finite index. 

We are now ready to prove that each member of a large class of finitely gen- 
erated groups has an automata presentation. This lemma significantly extends 
Example 15 which dealt with abelian finitely generated groups. 

Lemma 11. Any virtually abelian finitely generated group is automatic. 



Proof. Let G be a virtually abelian finitely generated group. By Fact 25, let 
-/V = {xi, . . . , Xk) be an abelian torsion-free normal subgroup of finite index of 
G. Thus, there are ai, . . . , a„ £ G such that G = LJ"^ja,iV. Since N is normal, 
for any j = 1, . . . , n there are hi, . . . ,hk & N such that 

T '^'1 1(7) 1(7) 7 ^1 k(l) ITT'k 

Xittj = Qjhi = (ijX^ • • - Xj, . . . , Xfeaj = Qjhk = UjXi ' ^ ' • • - x^, 

Moreover, there is a function / : {1, . . . , n} x {1, . . . , n} ^ {1, . . . , n} such that 

a,aj = a/(,j)/i(^j) = af^ij^x^ ---Xk 

For each g E G there la h <E N and i G 1, . . . , n so that g = a^/i = a^x™"^ • • • a;^'° . 
Therefore, we can express the product of two elements of the group as 

(a,xf • • •<) • {ajxf ■■■xf) =«,.(,,),xf • • • 

Expressing group multiplication in this way is well-suited to automata operations 
and leads to an automata presentation of G. □ 

In fact, [46] contains the following theorem which shows that the class of 
virtually abelian finitely generated groups coincides exactly with the class of 
automata presentable finitely generated groups. 

Theorem 26 (Oliver, Thomas; 2005). A finitely generated group is auto- 
matic if and only if the group is virtually abelian. 

To prove Theorem 26, it remains to prove only one direction of the classifi- 
cation. We will use several definitions and facts from group theory to prove this 

direction (cf. [20], [50], [45]). 

Definition 9. The commutator of a group G is the set 

[G,G\ = {[9.h]=g-^h-^gh:g,heG]. 

The powers of a group are defined inductively as G^ = G and G''^^ — [Gfe, Gk]- 
The group G is solvable if there is some n such that = {e}. The maps 7^ 
are defined inductively by 

70(G) =G, 7fe+i(G) = [7(Gfe),G]. 

The group G is nilpotent ifjniG) = {e} for some n. 

Fact 27 IfG is nilpotent then G is solvable. 

The following theorems of group theory relate algorithmic and growth prop- 
erties of a group to the group theoretic notions defined above. These theorems 
are instrumental in proving Theorem 26 since we understand the algorithmic and 
growth properties of groups with automata presentations. Note that a finitely 
generated group is said to have polynomial growth if the size of the n*'* gen- 
eration (G„) of the set of generators of the group is polynomial in n (recall the 
definition of the generations of a structure from Subsection 2.2). 



Theorem 28 (Gromov; 1981). // a finitely generated group has polynomial 
growth then it is virtually nilpotent. 

Theorem 29 (Romanovskii; 1980. Noskov; 1984). A virtually solvable group 
has a decidable first- order theory if and only if it is virtually abelian. 

Proof (Theorem 26). Let G be an automatic finitely generated group. Suppose 
G = {ai, .... flfe). By the generation lemma for monoids (Lemma 2), for each n, 
Gn{{ai_,...,ak}) C i;Ciog(n)^ Therefore, |G'„({ai, . . . , afe})| < , and so G has 
polynomial growth. By Gromov's theorem, G is virtually nilpotent, hence Fact 
27 implies that it is virtually solvable. Since G is automatic, it has a decidable 
first-order theory. Hence, Romanovskii and Noskov's theorem implies that G is 
virtually abelian, as required. □ 

We have just seen a complete description of the finitely generated groups 
which have word automata presentations. In the Boolean algebra case, such a 
description led to an algorithm for the isomorphism question. However, whether 
such an algorithm exists in the current context is still an open question: is the 
isomorphism problem for automatic finitely generated groups decidable? 

Many other questions can be asked about automata presentations for groups. 
For example, we might shift our attention away from finitely generated groups 
and ask whether the isomorphism problem for automatic torsion-free abelian 
groups is decidable. A problem which has been attempted without success by 
many of the most distinguished researchers in automatic structures is to de- 
termine whether the additive group of the rational numbers (Q, +) has a word 
automata presentation. More details about the current state of the art in au- 
tomata presentable groups may be found in the survey paper [44] . 

3 Complicated Structures 

3.1 Scott ranks of word automatic structures 

In the previous lecture we saw positive characterization results and relatively 
low tight bounds on classes of automatic structures. In particular, we saw that 
u) is the tight bound on the Cantor-Bendixson ranks of automatic partial order 
trees, and that is the tight bound on the ordinal heights of automatic well- 
founded relations. We will now prove results at the opposite end of the spectrum: 
results that say that automatic structures can have arbitrarily high complexity in 
some sense. This will have complexity theoretic implications on the isomorphism 
problem for the class of automatic structures. 

We begin by recalling an additional notion of rank for the class of countable 

structures. This Scott rank was introduced in [56] in the context of Scott's 
famous isomorphism theorem for countable structures. 

Definition 10. Given a countable structure A, for tuples a,b € A" we define 
the following equivalence relations. 



— d=^bifd and b satisfy the same quantifier free sentences, 

— for ordinal a > 0, d =" b if for all (3 < a, for each tuple c there is a tuple 
d such that d, c =^ b,d and for each tuple d there is a tuple c such that 

d, c =^ b, d. 

The Scott rank of tuple d is the least (3 such that for all b S A", d =^ b implies 
{A,d) = {A,b). The Scott rank of the structure A, denoted SR{A), is the least 
ordinal greater than the Scott ranks of all tuples of A. 

The Scott rank was extensively studied in the context of computable model 
theory. In particular, Nadel [42] and Harrison [24] showed that the tight upper 
bound for the Scott rank of computable structures is + 1. Recall that ^ 
is the first non-computable ordinal; that is, it is the least ordinal which is not 
isomorphic to a computable well-ordering of the natural numbers. However, most 
common examples of automatic structures have low Scott ranks. The following 
theorem from [35] proves that automatic structures have the same tight upper 
bound on the Scott rank as computable structures. 

Theorem 30 (Khoussainov, Minnes; 2007). For each a < wp^ -|- 1, there 

is an automatic structure with Scott rank at least a. Moreover, there is an auto- 
matic structure with Scott rank . 

Proof. We outline the idea of the proof. The main thrust of the argument is the 
transformation of a given computable structure to an automatic structure which 
has similar Scott rank. Let C = {C; Ri, . . . , Rm) be a computable structure. 
For simplicity in this proof sketch, we assume m = 1 and that C — E* for 
some finite E. Let A4 he a. Turing machine computing the relation R. The 
configuration space Conf{M) of the machine is a graph whose nodes encode 
configurations of A4 and whore there is an edge between two nodes if A4 has 
an instruction which takes it in one step from the configuration represented by 
one node to that of the other (see Example 11). Recall that Conf{M.) is an 
automatic structure. We call a deterministic Turing machine reversible if its 
configuration space is well-founded; that is, it consists only of finite chains or 
chains of type oj. The following lemma from [6] allows us to restrict our attention 
to reversible Turing machines. 

Lemma 12 (Bennett; 1973). Any deterministic Turing machine may he sim- 
ulated by a reversible Turing machine. 

Without loss of generality, we assume that is a reversible Turing machine 
which halts if and only if its output is "yes" . We classify the chains in Conf{M.) 
into three types: terminating computation chains are finite chains whose base 
is a valid initial configuration, non-terminating computation chains arc infinite 
chains whose base is a valid initial configuration, and unproductive chains are 
chains whose base is not a valid initial configuration. We perform the following 
smoothing operations to Conf{A4) in order to capture the isomorphism type of 
C within the new automatic structure. Note that each of these smoothing steps 
preserves automaticity. First, we add infinitely many copies of a; chains and finite 



chains of every finite size. Also, we connect to each base of a computation chain 
a structure which consists of infinitely many chains of each finite length. Finally, 
we connect representations of each tuple x in C to the initial configuration of M. 
given X as input. We call the resulting automatic graph A. The following lemma 
can be proved using the defining equivalence relations of Scott rank and reflects 
the idea that the automatic graph contains witnesses to many properties of the 
computable structure. 

Lemma 13. SR{C) < SR{A) <2 + SR{C). 

At this point, wc have a tight connection between Scott ranks of automatic 
and computable structures. The following lemmas from [19] and [37] describe 
the Scott ranks that are realized by computable structures, 

Lemma 14 (Goncharov, Knight; 2002). For each computable ordinal a < 
uji^ , there is a computable structure whose Scott rank is above a. 

Lemma 15 (Knight, Millsir; in print). There is a computable structure with 
Scott rank ^ . 

We apply Lemma 13 to Lemmas 14 and 15 to produce automatic structures 
with Scott ranks above every computable ordinal and at wp^. To produce an 
automatic structure with Scott rank ^ + 1 , wc apply Lemma 13 to Harrison's 
ordering from [24]. This concludes the proof of Theorem 30. □ 

Corollary 21 (Khoussainov, Rubin, Nies, Stephan; 2004). The isomor- 
phism problem for automatic structures is El-complete. 

Proof. In the proof of Theorem 30, the transformation of computable structures 
to automatic structures preserves isomorphism types. Hence, the isomorphism 
problem for computable structures is reduced to that for automatic structures. 
Since the isomorphism problem for computable structures is 17^-complete, the 
isomorphism problem for automatic structures is i^J-complete as well. □ 

3.2 More high bounds for word automatic structures 

The previous subsection contained an example where the behaviour of automatic 
structures matched that exhibited by the class of computable structures. The 
techniques of Theorem 30 can be used to obtain similar results in a couple of 
other contexts. We first revisit the idea of automatic trees, introduced in Subsec- 
tion 2.4. In that subsection, we saw that all automatic partial order trees have 
finite Cantor-Bendixson rank (Theorem 21). Consider now a different viewpoint 
of trees. 

Definition 11. A successor tree is A= {A;S) where, if <s is the transitive 
closure of S, {A; <s) is a partial order tree. 



Observe that the successor tree associated with a given partial order tree is 
first-order definable in the partial order tree. Hence, Corollary 3 implies that any 
automatic partial order tree is an automatic successor tree. However, the inclu- 
sion is strict: the following theorem from [35] shows that the class of automatic 
successor trees is far richer than the class of automatic partial order trees. 

Theorem 31 (Khoussainov, Minnes; 2007). For each computable ordinal 
a < there is an automatic successor tree of Cantor-Bendixson rank a. 

The proof of Theorem 31 utilizes the configuration spaces of Turing machines, 
as in the proof of Theorem 30. Another setting in which these tools are useful 
is that of well-founded relations (see [35]). In Subsection 2.3, we proved that 
automatic well-founded partial orders have ordinal heights below u^. If we relax 
the requirement that the relation be a partial order, we can attain much higher 
ordinal heights. 

Theorem 32 (Khoussainov, Minnes; 2007). For each com,putable ordinal 
a < oj^^ there is an automatic well-founded relation whose ordinal height is at 
least a. 

Theorem 32 answers a question posed by Moshe Vardi in the context of pro- 
gram termination. Given a program P, we say that the program is terminating 
if every computation of P from an initial state is finite. If there is a computation 
from a state x to y then we say that y is reachable from x. Thus, the program is 
terminating if the collection of all states reachable from the initial state is a well- 
founded set. The connection between well-foundedness and program termination 
is explored further in [7]. 

3.3 Borel structures 

Most of Lectures 2 and 3 so far dealt exclusively with word automatic structures. 
To conclude this lecture, we turn again to automata on infinite strings (Biichi 
automata) and infinite trees (Rabin automata). Recall from Subsection 2.1 that 
word automatic structures arc a strict subset of tree automatic structures. We 
would like a similar separation between Biichi automatic structures and Rabin 
automatic structures. To arrive at such a separation, we recall a complexity 
hierarchy of sets from descriptive set theory (a good reference is [28]). 

Definition 12. A set is called Borel in a given topology if it is a memher of the 
smallest class of sets which contains all open sets and closed sets and is closed 
under countable unions and countable intersections. 

In the context of automatic structures, we have an underlying topology which 

depends on the objects being processed by the automata. If the input objects 
are infinite binary strings (as in the case of Biichi automata) , the basic open sets 
of the topology are defined as 

[a] ={ae {0, lY -.(K a) 

for each finite string cr. Based on this topology, we have the following definition. 



Definition 13. A structure is Borel if its domain and basic relations are Borel 
sets. 



Fact 33 Every Biichi automatic structure is Borel. 

We can use Fact 33 to prove that not all Rabin automatic structures are 
recognizable by Biichi automata. [25] contains the following theorem about Rabin 
automatic structures. 

Theorem 34 (Hjorth, Khoussainov, Montalban, Nies; 2008). There is 
a Rabin automatic structure that is not Borel and, therefore, is not Biichi auto- 
matic. 

Proof. Consider the tree language 



It is easy to see that V is Rabin recognizable. However, it is not Borel: con- 
sider the embedding of into T which takes ni-- - rik to l"i01"20. . . l"*". 
The pre-image of V is the set of trees in w^'^ which have no infinite path. This 
set is an archetypal example of a non-Borel set, and hence V is not Borel. We 
will now transfer this example to the setting of structures by coding V into a 
Rabin automatic structure. The domain of this structure is the class of {0, 1}- 
labelled trees {T,v). The structure has two unary predicates: S = {{T,v) : 
there is a unique x for which v{x) = 1}, and V as above. In addition, the struc- 
ture contains two unary functions Left' . Right/ which mimic the operations on 
T. Then {D; S, V, Left', Right') is Rabin automatic. But, if it had a Borel copy 
then V would be Borel, and so the structure is not Borel. Thus, we have a Rabin 
automatic structure that is not Biichi automatic. □ 

We conclude these lectures by proving Theorem 6 from Subsection 1.4. To 
do so, we need to exhibit a Biichi automatic structure and a Biichi recognizable 
equivalence relation such that their quotient is not Biichi recognizable. By Fact 
33, it suffices to show that the quotient is not Borel. We will use the following 
well-known fact from descriptive set theory (see [26] for a survey of relevant 
results in this area). Recall that for X,Y C 'N, X =* Y means that X and Y 
agree except at finitely many points. 

Fact 35 There is no Borel function F : 'P(N) {0, 1}'^ such that for all X,Y C 
N,X=*Y if and only if F{X) = F{Y) 

Proof (Theorem 6). The structure we will consider is an expansion of the disjoint 
union of two famihar structures. Let B = (^(N); C) and B* = (^(N)/ =*; <). 
We will study the disjoint of union of B and B* along with a unary relation U {x) 
satisfying U{x) <;=^ x G V{H) and a binary relation R{x,y) interpreted as the 
canonical projection of V(N) into 7'(N)/ =*. 

Suppose that Bq = {Bq; <o) and Bi = {Bi; <i) are two disjoint presentations 
of S = (P(N); C). Let ?7° pick out So and R° be the bijection from Bq to Bi 



V = {(T, w) : each path through T has finitely many Is}. 



which acts hke the identity on V{N). Then A={BoU Bi; <o U <i, [7°, is a 
Biichi recognizable structure. We define the equivalence relation 



It is not hard to see that E{x, y) is Biichi recognizable given that Bq and B\ are. 
Observe that A/E ^ (P(N) U P(N)/ =*; <, U, R). 

Assume for a contradiction that A/E has a Biichi automata presentation. By 
Fact 33, this implies that A/E has a Borel presentation C = (C; <'-'', Lf^ , Rp). 
Let ^ : A/ — > C be an isomorphism and G : Bq ^ Che the restriction of # to 
Bq. The following lemma tells us that G is a Borel function. 

Lemma 16. If S = {S;<), S' = (S"; <') are Biichi structures and E,E' are 
Biichi equivalence relations such that S / E,S' / E' = (7'(N); C) then if\P : S/E — > 
S'/E' is an isomorphism, graph{\l/) = {{x,y) €5x5": <P([x]e) = is 
Borel. 

To prove the lemma, it suffices to show that graph{^) is a countable inter- 
section of Borel sets. Recall that (P(N); C) can be viewed as a Boolean algebra 
whose atoms are singleton sets {n} for n G N. Let { : n G N} be a list of 
atoms of S/E and let 6„ G S' be such that !f'([a„]_E) = \bn\E'- Since ^ is an 
isomorphism of Boolean algebras, for any x £ S,y € S' 



Because <S, S' are Biichi structures, the relations a„ < a; and 6„ <' y are Borel 
and we have a Borel definition for iZ'([x]_e) = [y]E'- The lemma is now proved 

and we return to the proof of Theorem 6. 

The function G : Bo ^ C satisfies the hypotheses of the above lemma and 
hence is Borel. We define the map F = R' oG from Bq ^ 'P(N) to C. Note that 
since F is the composition of Borels, it is Borel. Moreover, for any X, F C N 
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